Feature Selection for Genomic Signal Processing: Unsupervised, Supervised, and Self-Supervised Scenarios
Identifieur interne : 002A84 ( Main/Exploration ); précédent : 002A83; suivant : 002A85Feature Selection for Genomic Signal Processing: Unsupervised, Supervised, and Self-Supervised Scenarios
Auteurs : S. Y. Kung [États-Unis, République populaire de Chine] ; Yuhui Luo [États-Unis] ; Man-Wai MakSource :
- Journal of Signal Processing Systems [ 1939-8018 ] ; 2010-10-01.
English descriptors
- KwdEn :
Abstract
Abstract: An effective data mining system lies in the representation of pattern vectors. For many bioinformatic applications, data are represented as vectors of extremely high dimension. This motivates the research on feature selection. In the literature, there are plenty of reports on feature selection methods. In terms of training data types, they are divided into the unsupervised and supervised categories. In terms of selection methods, they fall into filter and wrapper categories. This paper will provide a brief overview on the state-of-the-arts feature selection methods on all these categories. Sample applications of these methods for genomic signal processing will be highlighted. This paper also describes a notion of self-supervision. A special method called vector index adaptive SVM (VIA-SVM) is described for selecting features under the self-supervision scenario. Furthermore, the paper makes use of a more powerful symmetric doubly supervised formulation, for which VIA-SVM is particularly useful. Based on several subcellular localization experiments, and microarray time course experiments, the VIA-SVM algorithm when combined with some filter-type metrics appears to deliver a substantial dimension reduction (one-order of magnitude) with only little degradation on accuracy.
Url:
DOI: 10.1007/s11265-008-0273-8
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 000209
- to stream Istex, to step Curation: 000209
- to stream Istex, to step Checkpoint: 000798
- to stream Main, to step Merge: 002B10
- to stream Main, to step Curation: 002A84
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Feature Selection for Genomic Signal Processing: Unsupervised, Supervised, and Self-Supervised Scenarios</title>
<author><name sortKey="Kung, S Y" sort="Kung, S Y" uniqKey="Kung S" first="S. Y." last="Kung">S. Y. Kung</name>
</author>
<author><name sortKey="Luo, Yuhui" sort="Luo, Yuhui" uniqKey="Luo Y" first="Yuhui" last="Luo">Yuhui Luo</name>
</author>
<author><name sortKey="Mak, Man Wai" sort="Mak, Man Wai" uniqKey="Mak M" first="Man-Wai" last="Mak">Man-Wai Mak</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:DEC7884D0F11AB1F2B7BBF66EAC1290D1D036BB3</idno>
<date when="2008" year="2008">2008</date>
<idno type="doi">10.1007/s11265-008-0273-8</idno>
<idno type="url">https://api.istex.fr/ark:/67375/VQC-RV3XCRW0-V/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000209</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">000209</idno>
<idno type="wicri:Area/Istex/Curation">000209</idno>
<idno type="wicri:Area/Istex/Checkpoint">000798</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000798</idno>
<idno type="wicri:doubleKey">1939-8018:2008:Kung S:feature:selection:for</idno>
<idno type="wicri:Area/Main/Merge">002B10</idno>
<idno type="wicri:Area/Main/Curation">002A84</idno>
<idno type="wicri:Area/Main/Exploration">002A84</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Feature Selection for Genomic Signal Processing: Unsupervised, Supervised, and Self-Supervised Scenarios</title>
<author><name sortKey="Kung, S Y" sort="Kung, S Y" uniqKey="Kung S" first="S. Y." last="Kung">S. Y. Kung</name>
<affiliation wicri:level="4"><country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Princeton University, Princeton, NJ</wicri:regionArea>
<orgName type="university">Université de Princeton</orgName>
<placeName><settlement type="city">Princeton (New Jersey)</settlement>
<region type="state">New Jersey</region>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>National Chung Hsing University, 250 Kuo Kuang Rd., 402, Taichung, Taiwan</wicri:regionArea>
<wicri:noRegion>Taiwan</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Luo, Yuhui" sort="Luo, Yuhui" uniqKey="Luo Y" first="Yuhui" last="Luo">Yuhui Luo</name>
<affiliation wicri:level="4"><country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Princeton University, Princeton, NJ</wicri:regionArea>
<orgName type="university">Université de Princeton</orgName>
<placeName><settlement type="city">Princeton (New Jersey)</settlement>
<region type="state">New Jersey</region>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">États-Unis</country>
</affiliation>
</author>
<author><name sortKey="Mak, Man Wai" sort="Mak, Man Wai" uniqKey="Mak M" first="Man-Wai" last="Mak">Man-Wai Mak</name>
<affiliation><wicri:noCountry code="subField">SAR</wicri:noCountry>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="j">Journal of Signal Processing Systems</title>
<title level="j" type="sub">for Signal, Image, and Video Technology(formerly the Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology)</title>
<title level="j" type="abbrev">J Sign Process Syst</title>
<idno type="ISSN">1939-8018</idno>
<idno type="eISSN">1939-8115</idno>
<imprint><publisher>Springer US; http://www.springer-ny.com</publisher>
<pubPlace>Boston</pubPlace>
<date type="published" when="2010-10-01">2010-10-01</date>
<biblScope unit="volume">61</biblScope>
<biblScope unit="issue">1</biblScope>
<biblScope unit="page" from="3">3</biblScope>
<biblScope unit="page" to="20">20</biblScope>
</imprint>
<idno type="ISSN">1939-8018</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">1939-8018</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Feature selection</term>
<term>Filter</term>
<term>Genomics</term>
<term>Microarray</term>
<term>Self-supervised</term>
<term>Sequence</term>
<term>Supervised</term>
<term>Unsupervised</term>
<term>Wrapper</term>
</keywords>
</textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: An effective data mining system lies in the representation of pattern vectors. For many bioinformatic applications, data are represented as vectors of extremely high dimension. This motivates the research on feature selection. In the literature, there are plenty of reports on feature selection methods. In terms of training data types, they are divided into the unsupervised and supervised categories. In terms of selection methods, they fall into filter and wrapper categories. This paper will provide a brief overview on the state-of-the-arts feature selection methods on all these categories. Sample applications of these methods for genomic signal processing will be highlighted. This paper also describes a notion of self-supervision. A special method called vector index adaptive SVM (VIA-SVM) is described for selecting features under the self-supervision scenario. Furthermore, the paper makes use of a more powerful symmetric doubly supervised formulation, for which VIA-SVM is particularly useful. Based on several subcellular localization experiments, and microarray time course experiments, the VIA-SVM algorithm when combined with some filter-type metrics appears to deliver a substantial dimension reduction (one-order of magnitude) with only little degradation on accuracy.</div>
</front>
</TEI>
<affiliations><list><country><li>République populaire de Chine</li>
<li>États-Unis</li>
</country>
<region><li>New Jersey</li>
</region>
<settlement><li>Princeton (New Jersey)</li>
</settlement>
<orgName><li>Université de Princeton</li>
</orgName>
</list>
<tree><noCountry><name sortKey="Mak, Man Wai" sort="Mak, Man Wai" uniqKey="Mak M" first="Man-Wai" last="Mak">Man-Wai Mak</name>
</noCountry>
<country name="États-Unis"><region name="New Jersey"><name sortKey="Kung, S Y" sort="Kung, S Y" uniqKey="Kung S" first="S. Y." last="Kung">S. Y. Kung</name>
</region>
<name sortKey="Luo, Yuhui" sort="Luo, Yuhui" uniqKey="Luo Y" first="Yuhui" last="Luo">Yuhui Luo</name>
<name sortKey="Luo, Yuhui" sort="Luo, Yuhui" uniqKey="Luo Y" first="Yuhui" last="Luo">Yuhui Luo</name>
</country>
<country name="République populaire de Chine"><noRegion><name sortKey="Kung, S Y" sort="Kung, S Y" uniqKey="Kung S" first="S. Y." last="Kung">S. Y. Kung</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002A84 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 002A84 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Sante |area= MersV1 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:DEC7884D0F11AB1F2B7BBF66EAC1290D1D036BB3 |texte= Feature Selection for Genomic Signal Processing: Unsupervised, Supervised, and Self-Supervised Scenarios }}
This area was generated with Dilib version V0.6.33. |